AITopics | part model

Collaborating Authors

part model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CycleSL: Server-Client Cyclical Update Driven Scalable Split Learning

Wang, Mengdi, Bozkir, Efe, Kasneci, Enkelejda

arXiv.org Artificial IntelligenceNov-25-2025

Split learning emerges as a promising paradigm for collaborative distributed model training, akin to federated learning, by partitioning neural networks between clients and a server without raw data exchange. However, sequential split learning suffers from poor scalability, while parallel variants like parallel split learning and split federated learning often incur high server resource overhead due to model duplication and aggregation, and generally exhibit reduced model performance and convergence owing to factors like client drift and lag. T o address these limitations, we introduce CycleSL, a novel aggregation-free split learning framework that enhances scalability and performance and can be seamlessly integrated with existing methods. Inspired by alternating block coordinate descent, CycleSL treats server-side training as an independent higher-level machine learning task, resampling client-extracted features (smashed data) to mitigate heterogeneity and drift. It then performs cyclical updates, namely optimizing the server model first, followed by client updates using the updated server for gradient computation. W e integrate CycleSL into previous algorithms and benchmark them on five publicly available datasets with non-iid data distribution and partial client attendance. Our empirical findings highlight the effectiveness of CycleSL in enhancing model performance.

artificial intelligence, cyclesl, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2511.18611

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Interpreting Transformers for Jet Tagging

Wang, Aaron, Gandrakota, Abhijith, Ngadiuba, Jennifer, Sahu, Vivekanand, Bhatnagar, Priyansh, Khoda, Elham E, Duarte, Javier

arXiv.org Artificial IntelligenceDec-8-2024

Machine learning (ML) algorithms, particularly attention-based transformer models, have become indispensable for analyzing the vast data generated by particle physics experiments like ATLAS and CMS at the CERN LHC. Particle Transformer (ParT), a state-of-the-art model, leverages particle-level attention to improve jet-tagging tasks, which are critical for identifying particles resulting from proton collisions. This study focuses on interpreting ParT by analyzing attention heat maps and particle-pair correlations on the $\eta$-$\phi$ plane, revealing a binary attention pattern where each particle attends to at most one other particle. At the same time, we observe that ParT shows varying focus on important particles and subjets depending on decay, indicating that the model learns traditional jet substructure observables. These insights enhance our understanding of the model's internal workings and learning process, offering potential avenues for improving the efficiency of transformer architectures in future high-energy physics applications.

artificial intelligence, machine learning, particle, (18 more...)

arXiv.org Artificial Intelligence

2412.03673

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > California > San Diego County > San Diego (0.05)
North America > United States > Illinois > Kane County > Batavia (0.04)
North America > United States > California > San Diego County > La Jolla (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

MicroT: Low-Energy and Adaptive Models for MCUs

Huang, Yushan, Aloufi, Ranya, Cadet, Xavier, Zhao, Yuchen, Barnaghi, Payam, Haddadi, Hamed

arXiv.org Artificial IntelligenceJul-9-2024

We propose MicroT, a low-energy, multi-task adaptive model framework for resource-constrained MCUs. We divide the original model into a feature extractor and a classifier. The feature extractor is obtained through self-supervised knowledge distillation and further optimized into part and full models through model splitting and joint training. These models are then deployed on MCUs, with classifiers added and trained on local tasks, ultimately performing stage-decision for joint inference. In this process, the part model initially processes the sample, and if the confidence score falls below the set threshold, the full model will resume and continue the inference. We evaluate MicroT on two models, three datasets, and two MCU boards. Our experimental evaluation shows that MicroT effectively improves model performance and reduces energy consumption when dealing with multiple local tasks. Compared to the unoptimized feature extractor, MicroT can improve accuracy by up to 9.87%. On MCUs, compared to the standard full model inference, MicroT can save up to about 29.13% in energy consumption. MicroT also allows users to adaptively adjust the stage-decision ratio as needed, better balancing model performance and energy consumption. Under the standard stage-decision ratio configuration, MicroT can increase accuracy by 5.91% and save about 14.47% of energy consumption.

energy consumption, feature extractor, inference, (15 more...)

arXiv.org Artificial Intelligence

2403.0804

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > North Yorkshire > York (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Energy (1.00)
Education (0.94)
Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Hardware (0.94)
Information Technology > Communications > Mobile (0.93)
(5 more...)

Add feedback

4-Dimensional deformation part model for pose estimation using Kalman filter constraints

Martinez-Berti, Enrique, Sanchez-Salmeron, Antonio-Jose, Ricolfe-Viala, Carlos

arXiv.org Artificial IntelligenceFeb-7-2024

The main goal of this article is to analyze the effect on pose estimation accuracy when using a Kalman filter added to 4-dimensional deformation part model partial solutions. The experiments run with two data sets showing that this method improves pose estimation accuracy compared with state-of-the-art methods and that a Kalman filter helps to increase this accuracy.

estimation, pose estimation, yang and ramanan, (16 more...)

arXiv.org Artificial Intelligence

2402.04953

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(10 more...)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Vision > Video Understanding (0.94)

Add feedback

Learning a discriminative hidden part model for human action recognition

Neural Information Processing SystemsApr-6-2023, 14:31:21 GMT

We present a discriminative part-based approach for human action recognition from video sequences using motion features. Our model is based on the recently proposed hidden conditional random field (hCRF) for object recognition. Similar to hCRF for object recognition, we model a human action by a flexible constellation of parts conditioned on image observations. Different from object recognition, our model combines both large-scale global features and local patch features to distinguish various actions. Our experimental results show that our model is comparable to other state-of-the-art approaches in action recognition.

human action recognition, part model, recognition, (1 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Part-Based Models Improve Adversarial Robustness

Sitawarin, Chawin, Pongmala, Kornrapat, Chen, Yizheng, Carlini, Nicholas, Wagner, David

arXiv.org Artificial IntelligenceMar-8-2023

We show that combining human prior knowledge with end-to-end learning can improve the robustness of deep neural networks by introducing a part-based model for object classification. We believe that the richer form of annotation helps guide neural networks to learn more robust features without requiring more samples or larger models. Our model combines a part segmentation model with a tiny classifier and is trained end-to-end to simultaneously segment objects into parts and then classify the segmented object. Empirically, our part-based models achieve both higher accuracy and higher adversarial robustness than a ResNet-50 baseline on all three datasets. For instance, the clean accuracy of our part models is up to 15 percentage points higher than the baseline's, given the same level of robustness. Our experiments indicate that these models also reduce texture bias and yield better robustness against common corruptions and spurious correlations. The code is publicly available at https://github.com/chawins/adv-part-model.

accuracy, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2209.09117

Country:

North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Learning a discriminative hidden part model for human action recognition

Wang, Yang, Mori, Greg

Neural Information Processing SystemsFeb-15-2020, 03:56:07 GMT

human action recognition, part model, recognition, (1 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

An approach to identify design and manufacturing features from a data exchanged part model

#artificialintelligenceSep-15-2003

Due to the large variety of CAD systems in the market, data exchange between different CAD systems is indispensable. Currently, data exchange standards such as STEP and IGES, etc. provide a unique approach for interfacing among different CAD platforms. Once the feature-based CAD model created in one CAD system is input into another via data exchange standards, many of the original features and the feature-related information may not exist any longer. The identification of the design features and their further decomposition into machining features for the downstream activities from a data exchanged part model is a bottleneck in integrated product and process design and development. In this paper, the feature panorama is succinctly articulated from the viewpoint of product design and manufacturing.

identification, identify design and manufacturing feature, part model, (3 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Integration (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)

Add feedback